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Using a weak migration and weak mutation approximation, I study the average waiting time to 
and the average duration of parapatric speciation. The description of reproductive isolation used is 
based on the classical Dobzhansky model and its recently proposed multilocus generalizations. The 
dynamics of parapatric speciation is modeled as a biased random walk with absorption performed 
by the average genetic distance between the residents and immigrants. If a small number of genetic 
^ ! changes is sufficient for complete reproductive isolation, mutation and random genetic drift alone 
can cause speciation on the time scale of 10-1000 times the inverse of the mutation rate. Even 
relatively weak selection for local adaptation can dramatically decrease the waiting time to specia- 
tion. The duration of parapatric speciation is shorter by orders of magnitude than the waiting time 
to speciation. For a wide range of parameter values, the duration of speciation is order one over 
the mutation rate. In general, parapatric speciation is expected to be triggered by changes in the 
' environment. 
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Parapatric speciation is usually defined as the process of species formation in the presence of some 
gene flow between diverging populations. From a theoretical point of view, parapatric speciation 
represents the most general scenario of speciation which includes both allopatric and sympatric spe- 
ciation as extreme cases (of zero gene flow and a very large gene flow, respectively). The geographic 
structure of most species, which are usually composed of many local populations experiencing little 
genetic contact for long periods of time (Avise 2000), fits the one implied in the parapatric speciation 
scenario. In spite of this, parapatric speciation has received relatively little attention compared to 
a large number of empirical and theoretical studies devoted to allopatric and sympatric modes (but 
see Ripley & Beehler 1990; Burger 1995; Friesen & Anderson 1997; Rolan-Alvarez et al. 1997; Frias 
& Atria 1998; Macnair & Gardner 1998). Traditionally, studies of parapatric speciation emphasized 
the importance of strong selection for local adaptation in overcoming the homogenizing effects of 
migration (e.g. Endler 1977; Slatkin 1982). Recently, it has been shown theoretically that rapid 
parapatric speciation is possible even without selection for local adaptation if there are many loci 
affecting reproductive isolation and mutation is not too small relative to migration (Gavrilets et al. 
1998, 2000a; Gavrilets 1999). 
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Earlier theoretical studies of speciation mostly concentrated on the accumulation of genetic 
differences that could eventually lead to complete reproductive isolation. However, within the 
modeling frameworks previously used complete reproductive isolation was not possible (but see Nei 
et al. 1983; Wu 1985). Recently new approaches describing the whole process of speciation from 
origination to completion have been developed and applied to allopatric (Orr 1995; Orr & Orr 1996; 
Gavrilets & Hastings 1996; Gavrilets & Boake 1998; Gavrilets 1999), parapatric (Gavrilets 1999; 
Gavrilets et al. 1998, 2000a) and sympatric (e.g. Turner & Burrows 1995; Gavrilets & Boake 1998; 
van Doom et al. 1998; Dieckmann & Doebeli 1999) scenarios. Here, I develop a new stochastic 
approach to modeling speciation as a biased random walk with absorption. I use this framework 
to find the average waiting time to and the average duration of parapatric speciation. My results 
provides insights into a number of important evolutionary questions about the role of different 
factors (such as mutation, migration, random genetic drift, selection for local adaptation, genetic 
architecture of reproductive isolation) in controlling the time scale of parapatric speciation. 

The method for modeling reproductive isolation adapted below is based on the classical Dobzhan- 
sky model (Dobzhansky 1937) discussed in detail in a number of recent publications (e.g. Orr 1995; 
Orr & Orr 1996; Gavrilets & Hastings 1996; Gavrilets 1997). The Dobzhansky model as originally 
described has two important and somewhat independent features (Orr 1995). First, the Dobzhansky 
model suggests that in some cases reproductive isolation can be reduced to interactions of "comple- 
mentary" genes (that is genes that decrease fitness when present simultaneously in an organism). 
Second, it postulates the existence of a "ridge" of well-fit genotypes that connects two reproduc- 
tively isolated genotypes in genotype space. This "ridge" makes it possible for a population to evolve 
from one state to a reproductively isolated state without passing through any maladaptive states 
("adaptive valleys"). The original Dobzhansky model was formulated for the two- locus case. The 
development of multilocus generalizations has proceeded in two directions. A mathematical theory 
of the build up of incompatible genes leading to hybrid sterility or inviability was developed by Orr 
(1995; Orr & Orr 1996) who applied it to allopatric speciation. A complementary approach placing 
the most emphasis on "ridges" rather than on "incompatibilities" was advanced by Gavrilets (1997, 
1999; Gavrilets & Gravner 1997; Gavrilets et al. 1998, 2000a6). This approach makes use of a recent 
discovery that the existence of "ridges" is a general feature of multidimensional adaptive landscapes 
rather than a property of a specific genetic architecture (Gavrilets & Gravner 1997; Gavrilets 1997, 
2000). Here, I will use the "ridges-based" approach assuming that mating and the development of 
viable and fertile offspring is possible only between the organisms that are not too different over 
a specific set of loci responsible for reproductive isolation. The adaptive landscape arising in this 
model is an example of "holey adaptive landscapes" (Gavrilets & Gravner 1997; Gavrilets 1997, 
2000) of which the original two-locus two-allele Dobzhansky model is the simplest partial case. My 
general results are directly applicable to the original Dobzhansky model. 

2. MODEL 

I consider a finite population of sexual diploid organisms with discrete non-overlapping generations. 
The population is subject to immigration from another population. For example, one can think 
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of a peripheral population (or an island) receiving immigrants from a central population (or the 
mainland). All immigrants are homozygous and have a fixed "ancestral" genotype. Mutation 
supplies new genes in the population some of which may be fixed by random genetic drift and/or 
selection for local adaptation. Migration brings ancestral genes which, if fixed, will decrease genetic 
differentiation of the population from its ancestral state. 

In this paper, I consider only the loci potentially affecting reproductive isolation. The degree 
of reproductive isolation depends on the extent of genetic divergence at these loci. Let d be the 
number of loci at which two individuals differ. I posit that the probability, w, that two individuals 
are able to mate and produce viable and fertile offspring is a non-increasing function of d such that 
w(0) = 1 and w(d) = for all d > K where K is a parameter of the model specifying the genetic 
architecture of reproductive isolation. This implies that individuals with identical genotypes at the 
loci under consideration are completely compatible whereas individuals that differ in more than 
K loci are completely reproductively isolated. A small K means that a small number of genetic 
changes is sufficient for completely reproductive isolation. A large K means that a significant 
genetic divergence is necessary for completely reproductive isolation. If K is equal to the overall 
number of loci, complete reproductive isolation is impossible (neutral case). This simple model 
is appropriate for a variety of isolating barriers including premating, postmating prezygotic, and 
postzygotic (Gavrilets et al. 1998, 2000a6, Gavrilets 1999). I will allow the loci responsible for 
reproductive isolation to have pleiotropic effects on the degree of adaptation to the local environment 
(Gavrilets 1999; cf. Slatkin 1981; Rice 1984; Rice & Salt 1988). Specifically, I will assume that each 
new allele potentially has a selective advantage s (> 0) over the corresponding ancestral allele in 
the local environment. 

I will use a weak mutation and weak migration approximation (e.g. Slatkin 1976, 1981; Lande 
1979, 1985; Tachida & Iizuka, 1991; Barton 1993) neglecting within-population variation. Under 
this approximation the only role of mutation and migration is to introduce new alleles which quickly 
get fixed or lost. I will assume that the processes of fixation and loss of alleles at different loci are 
independent. Within this approximation, the relevant dynamic variable is the number of loci, -D&, 
at which a typical individual in the population is different from the immigrants. Variable D\, is 
the average genetic distance between residents and immigrants computed over the loci underlying 
reproductive isolation. The dynamics of speciation will be modeled as a random walk performed by 
D b on a set of integers 0, 1, . . . , K, K + 1. In what follows I will use Aj and ^ for the probabilities 
that Db changes from % to % + 1 or % — 1 in one time step (generation). The former outcome occurs 
if a new allele supplied by mutation gets fixed in the population. The latter outcome occurs if 
an ancestral allele brought by immigrants replaces a new allele previously fixed. I disregard the 
possibility of more than one substitution in one time step. Probabilities \ and ^ are small and 
depend on the rate of migration, m, the rate of mutation per gamete per generation, v, the strength 
of selection for local adaptation, s, and the population size, N. Speciation occurs when d hits the 
(absorbing) boundary K + 1. If this happens, the population is completely reproductively isolated 
from the ancestral genotypes. I do not consider the possibility of backward mutation towards an 
ancestral state. Fixing new alleles at K + 1 loci completes the process of speciation. 
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3. RESULTS 



I will compute two important characteristics of the speciation process. The first is the average 
waiting time to speciation, t , defined as the average time to reach the state of complete reproductive 
isolation (D b = K + 1) starting at the ancestral state (D b = 0). In general, during the interval from 
t — to the time of speciation the population will repeatedly accumulate a few substitutions only 
to lose them and return to the ancestral state at D b = 0. The second characteristic is the average 
duration of speciation, T , defined as the time that it takes to get from the ancestral state (D b = 0) 
to the state of complete reproductive isolation (D b = K + l) without returning to the ancestral state. 
[The duration of speciation is similar to the conditional time that a new allele destined to be fixed 
segregates before fixation.] 

(a) ALLOPATRIC SPECIATION 

It is illuminating to start with the case of no immigration (cf. Orr 1985; Orr & Orr 1996; Gavrilets 
1999, pp. 6-8). In this case, the process of accumulation of new mutations is irreversible and the 
average duration of speciation, T , is equal to the average waiting time to speciation, to- 

No selection for local adaptation. With no or very little within-population genetic variation the 
process of accumulation of substitutions leading to reproductive isolation is effectively neutral (cf., 
Orr 1995; Orr & Orr 1996). The average number of neutral mutations fixed per generation equals 
the mutation rate v (Kimura 1983). Thus, the average time to fix K + 1 mutations is 

Selection for local adaptation. In a diploid population of size N, the number of mutations per 
generation is 2Nv. The probability of a mutant allele with a small selective advantage s being 
fixed is approximately 2s/(l — exp{— ANs)) (Kimura 1983). Thus, the average time to fix K + 1 
mutations is 

K+l I- exp(-S) 

to a ' y z > 

v S 

where S = 4Ns. With S increasing from to, say, 10, the time to speciation to decreases to 
approximately 1/10 of that in the case of no selection for local adaptation. 



(b) PARAPATRIC SPECIATION 

With immigration, the dynamics of D b are controlled by two opposing types of forces. Mutation and 
selection act to increase D b whereas migration acts to decrease D b . The appendix presents exact 
formulae for t and T in the case of parapatric speciation. Below I give some simple approximations 
valid if ™ exp(-S) is not too small. 
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Threshold function of reproductive compatibility. Here, I assume that the function w{d) 
specifying the probability that two individuals are not reproductively isolated has a threshold form: 

, / 1 for d < K, ( s 

W ^ = { for d > K. ® 

(Gavrilets et al. 1998, 2000a6; Gavrilets 1999, cf. Higgs and Derrida 1992). This function implies 
that immigrants have absolutely no problems mating with the residents unless the genetic distance 
Db exceeds K. I start with the worst-case scenario for speciation when not only immigrants can 
easily mate with residents but also selection for local adaptation is absent (cf., Gavrilets et al. 1998, 
2000a; Gavrilets 1999). 
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Table 1: Exact expressions for the average waiting time to speciation, t^, and the 
average duration of speciation, T *, for small K with no selection for local adaptation 
and a threshold function of reproductive compatibility. R = m/v. 

No selection for local adaptation. With no selection for local adaptation and neglecting within- 
population genetic variation, the process of fixation is approximately neutral. The probability of 
fixation of an allele is equal to its initial frequency. The average frequency of new alleles per 
generation is approximately the mutation rate v. If the immigrants differ from the residents at 
Dt = i loci, there are % loci that can fix ancestral alleles brought by migration. The average 
frequency of such alleles per generation is im. Thus, the probabilities of stochastic transitions 
increasing and decreasing Df, by one are approximately 

\i = v, \ii = i m. (4) 

With small K, the exact expressions for t and T found in the Appendix are relatively compact (see 
Table 1). With larger K, the approximate equations are more illuminating. The average waiting 
time to speciation is approximately 

t;*-(-) K K\ (5) 



v \ v 
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A. B. 




m/v m/v 

Figure 1: The average waiting time to speciation, £q, and the average duration of 
speciation, T *. The x-axes give the ratio of migration and mutation rates. In Figure 
(A), the y-axis gives the product of tj$ and the mutation rate v on the logarithmic 
scale. In Figure (B), the y-axis gives the product of T * and the mutation rate v on 
the linear scale. Solid lines correspond to the exact values (found in the Appendix); 
dashed lines correspond to the approximate equations (|5]) and (|6|). In Figure (B), the 
lines correspond to K = 1,2,4,6,8, and 10 (from bottom to top). 



The average duration of speciation is approximately 

1/^+1)+^ 

v \ m/v J 

where 7 ~ 0.577 is Euler's constant and is the psi (digamma) function (Gradshteyn & Ryzhik, 
1994). [Function ^>{K + 1) + 7 slowly increases with K and is equal to 1 at K — 1, to 2.93 at 
K = 10 and to 5.19 at K = 100.] For example, if m = 0.01, v = 0.001 and K = 5, then the 
waiting time to speciation is very long: £q w 1.35 x 10 10 generations, but if speciation does hap- 
pen, its duration is relatively short: T * ps 1236 generations. Figure 1 illustrates the dependence of 
to and T * on model parameters. Notice that Tq is order 1/v across a wide range of parameter values. 

Selection for local adaptation. Assume that "new" alleles improve adaptation to the local con- 
ditions. Let s be the average selective advantage of a new allele over the corresponding ancestral 
allele. Each generation there are 2Nv such alleles supplied by mutation. The probability of fixation 
of an advantageous allele is approximately 2s/ [1 — exp(—4Ns)}. Migration brings approximately 




Figure 2: Effects of selection for local adaptation and of linear function of reproductive 
compatibility on the waiting time to speciation. (A) The proportion (on the logarithm 
scale) by which selection for local adaptation decreases t (Fx = exp(-KS) 1 ~ eg: g(~ s ) • 
see equation |8|). (B) The proportion (on the logarithm scale) by which t is reduced 
relative to £q if the function of reproductive compatibility is linear (F2 = v^r exp(-K); 
see equation O). 



2Nmi ancestral alleles at the loci that have previously fixed new alleles. These alleles are delete- 
rious in the new environment. The probability of fixation of a deleterious allele is approximately 
2s/[exp(4:Ns) — 1] (Kimura 1983). Thus, the probabilities of stochastic transitions increasing and 
decreasing by one are approximately 

} y 4Ns lH = im ANs (7) 

1 — exp(-ANs) ' exp(4Ns) — 1 

The waiting time to speciation is approximately 

to « to exp(-KS) (8) 
where £j$ is given by equation (|^). The average duration of speciation is approximately 

roM i( 1 + »(«;i>^ ) i^zg). (9) 

v \ {m/v)e~ b J S 

For example, if m = 0.01, v = 0.001, K = 5 and S = 2, then to ~ 2.74 x 10 4 generations and 
T = 2170 generations. Thus, selection for local adaptation dramatically decreases to (in the nu- 
merical example, by the factor w 50,000) and somewhat increases T relative to the case of speciation 
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driven by mutation and genetic drift. Figure 2A illustrates the effect of selection for local adapta- 
tion on to in more detail. 

Linear function of reproductive compatibility. Here, I assume that the probability of 
no reproductive isolation decreases linearly with genetic distance d from one at d = to zero at 

~{ 1H/( * +1, £i>£ (io) 

Now, immigrants experience problems in finding compatible mates even when the genetic distance 
is below K + 1. 



No selection for local adaptation. With no selection for local adaptation, the probabilities of 
stochastic transitions Aj and /ij are given by equations (f|) with m substituted for an "effective" 
migration rate 



rrii — m 1 . (11) 

The waiting time to speciation is approximately 

t « t* exp{-K), (12) 
where tj$ is given by equation (H). The average duration of speciation is approximately 

ro ,l( 1 + 2 *(*+/ )+7 ). (13) 

v y m/v J 

where 7 is Euler's constant and \l/(.) is the psi (digamma) function. The last equation differs from 
equation (|6|) only by the factor 2 inside the parentheses. For example, with the same values of 
parameters as above t « 2.36 x 10 8 generations and T = 1470 generations. Thus, to is significantly 
reduced (by the factor 57) whereas To is somewhat larger than in the case of threshold function 
of reproductive compatibility. Figure 2B illustrates the effect of linear function of reproductive 
compatibility on to in more detail. 



Selection for local adaptation. With selection for local adaptation, the probabilities of stochastic 
transitions A, and /Xj are given by equations (0) with m substituted for an "effective" migration rate 
(frrD- The average time to speciation is approximately 

to w to v 7 ^ exp{-K) exp{-KS) 1 ~ ex ^~ S \ ( 14 ) 

The average duration of speciation T is given by equation (P) with an additional factor 2 placed in 
front of the ratio in the parentheses. As before, selection for local adaptation substantially decreases 
to and slightly increases T Q . 
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4. DISCUSSION 



The results presented above allow one to get insights about the time scale of parapatric speciation 
driven by mutation, random genetic drift and/or selection for local adaptation. I start the discussion 
of these results by considering the original Dobzhansky model. 

(a) Two-locus two-allele Dobzhansky model 

Dobzhansky's original model (Dobzhansky 1937) describes a two-locus two-allele system where a 
specific pair of alleles is incompatible in the sense that the interaction of these alleles "produces 
one of the physiological isolating mechanisms" (p. 282). Let us assume that the immigrants have 
ancestral haplotype ab and that the derived allele B is incompatible with the ancestral allele a 
(see Figure 4). In this case, the population can evolve to a state reproductively isolated from the 
ancestral state via a state with haplotype Ab fixed: ab — > Ab — > AB. [Recently Johnson et at. 
(2000) considered the probability of parapatric speciation driven by mutation in a somewhat similar 
model. However, their major equation (eq. 13) is heuristic and does not appear to be justified.] Let 
v be the probability of mutation from an ancestral allele (a and b) to the corresponding derived 
allele (A or B). The average waiting time to and the average duration of parapatric speciation in 
this system are given by our general equations with K — 1 and v — v. Allowing for equal selective 
advantage s of derived alleles over the ancestral alleles, 



where S = 4Ns and the approximations are good if {m/v)exp{—S) » 1. With no selection for 
local adaptation (that is if S = 0), to ~ rn/v 2 ,TQ ps \ jv. 

Let m = 0.01 and v = 10~ 5 . Then with no selection for local adaptation, the average waiting 
time to speciation is very long: t 10 s generations, and T ss 10 5 generations. However, even 
with relatively weak selection for local adaptation, t can decrease by 1-2 orders of magnitude. For 
example, with S = 1, t « 2.34 x 10 7 , T ps 6.34 x 10 4 ; with S = 2, t « 5.94 x 10 6 , T ps 4.36 x 10 4 ; 
and with S = 3, t ps 1.64 x 10 6 , T ^ 3.23 x 10 4 . Because the waiting time to speciation in 
the two-locus Dobzhansky model scales as one over the mutation rate per locus squared, this time 
is rather long. However, the overall number of loci involved in the initial stages of reproductive 
isolation is at least on the order of tens to hundreds (e.g. Singh 1990; Wu & Palopoli 1994; Coyne 
& Orr 1998; Naveira and Masida 1998). This increases the overall mutation rate and can make 
speciation much more rapid. 




(15a) 



(2u + me" s )(l - e~ 5 ) _ 1 1 - e 
vS{v + me~ 5 ) v S 
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Figure 3: Adaptive landscape in the two-locus two-allele Dobzhansky model. Alleles a 
and B are incompatible. The arrows specify the chain of gene substitutions leading to 
complete reproductive isolation. 

(b) Average waiting time to parapatric speciation 

In the models studied here, reproductive isolation is a consequence of cumulative genetic divergence 
over a set of loci potentially affecting mating behavior, fertilization processes, and/or offspring vi- 
ability and fertility. The underlying biological intuition is that organisms that are reproductively 
compatible should not be too different genetically. Most species consist of geographically struc- 
tured populations, some of which experience little genetic contact for long periods of time (Avise 
2000). Different mutations are expected to appear first and increase in frequency in different pop- 
ulations necessarily resulting in some geographic differentiation even without any variation in local 
selection regimes. An interesting question is whether mutation and drift alone are sufficient to 
result in parapatric speciation. This question is particularly important given a growing amount of 
data suggesting that rapid evolution of reproductive isolation is possible without selection for local 
adaptation involved (e.g. Vacquier 1998; Palumbi 1998; Howard 1999). Our results provide an 
affirmative answer to this question (see also Gavrilets et al. 1998, 2000a; Gavrilets 1999). However, 
here the waiting time to speciation is relatively short only if a very small number of genetic changes 
is sufficient for complete reproductive isolation. For example, to is on the order of 10-1000 times the 
inverse of the mutation rate if K = 1 or 2 with a threshold function of reproductive compatibility, 
and if K = 1, 2 or 3 with a linear function of reproductive compatibility. It is well recognized that 
selection for local adaptation can result in speciation in the presence of some gene flow (e.g. Slatkin 
1981; Rice 1984; Rice & Salt 1988; Schluter 1998). Our results show that even relatively weak 
selection can dramatically reduce the waiting time to speciation by orders of magnitude (see Figure 
2a). 
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(c) How much migration prevents speciation? 

In general, evolutionary biologists accept that very small levels of migration are sufficient to prevent 
any significant genetic differentiation of the populations not to mention speciation (e.g. Slatkin 
1987, but see Wade & McCauley 1984). To a large degree, this belief appears to be based on two 
observations. One is that the expected value of the fixation index F$t is small even with a single 
migrant per generation (e.g. Hartl & Clark 1997). Another is that the expected distribution of 
allele frequency in the island model changes from a U-shaped (which implies at least some genetic 
differentiation) to a bell-shaped (which implies no genetic differentiation on average) as the average 
number of migrants become larger than one per generation (e.g. Crow & Kimura 1970). However, 
the equilibrium expectations derived under neutrality theory can be rather misleading if there is a 
possibility for evolving complete reproductive isolation. For example, in the model with no selection 
for local adaptation considered above the expected change per generation in the genetic distance 
D b between the immigrants and residents is 

AD b = v — mD b , 

where the first term describes an expected increase in D b because of new mutations and the second 
term describes an expected decrease in D b because of the influx of ancestral genotypes. This 
equation predicts that D b will reach an equilibrium value of v/m. From this one can be tempted 
to conclude that unless the migration rate is smaller than that of mutation (v > m), D b cannot be 
larger than one and, thus, no speciation is possible. However, this argument is flawed. Because of 
the inherent stochasticity of the system there is always a non-zero probability of D b moving any 
pre-specified distance from which will lead to reproductive isolation. 

Strictly speaking, in the models studied here migration does not prevent but rather delays speci- 
ation. [The resulting delay can be substantial and for all practical reasons infinite.] For definiteness, 
I will say that speciation is effectively prevented if the average waiting time to speciation is larger 
than 1000 times the inverse of the mutation rate (that is if log 10 (vt ) > 3). If the number of genetic 
substitutions necessary for speciation is small (for example, K — 1, as in the original Dobzhansky 
model, or K — 2), then migration rates higher than 10t> will effectively prevent speciation in the 
absence of selection for local adaptation. For example, if v = 10™ 3 , then speciation is possible with 
m as high as 0.01. However, if v = 10~ 5 , then any migration rate higher than 0.0001 will effectively 
prevent speciation. If the number of genetic changes required for speciation is relatively large, say, 
if K = 10, then without selection for local adaptation speciation is effectively prevented (see Fig. 
4a). However, relatively weak selection, say with S = 2.5 would overcome migration rates as high 
as lOv if the strength of reproductive isolation increases linearly with genetic distance (see Fig.4b). 

Within the modeling framework used, all immigrants had a fixed genetic composition which 
did not change in time. Alternatively, one can imagine two populations exchanging migrants as- 
suming that both populations can evolve. If there is no selection for local adaptation, this case is 
mathematically equivalent to that studied above but with the mutation rate being twice as large 
as in the case of a single evolving population. Therefore, the waiting time to speciation in the 
two population case will dramatically decrease relative to that in the single population case. The 
maximum migration rates compatible with speciation will be twice as large as before. 
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Figure 4: Waiting time to speciation with selection for local adaptation for K = 10. 
Different lines correspond to S = 0.0,0.5,1.0,1.5,2.0,2.5, and 3.0 (from top to bottom). 
(A) Threshold function of reproductive compatibility (equation |3]). (B) Linear function 
of reproductive compatibility (equation |10|). 

(c) The role of environment 

The waiting time to speciation, t 0; is extremely sensitive to parameters: changing a parameter by a 
small factor, say two or three, can increase or decrease to by several orders of magnitude. Looking 
across a range of parameter values, to is either relatively short (if the parameters are right) or 
effectively infinite. Most of the parameters of the model (such as the migration rate, intensity of 
selection for local adaptation, the population size, and, probably, the mutation rate) directly depend 
on the state of the environment (biotic and abiotic) the population experiences. This suggests that 
speciation can be triggered by changes in the environment (cf. Eldredge 2000). Note that the time 
lag between an environmental change initiating speciation and an actual attainment of reproductive 
isolation can be quite substantial as our model shows. If it is an environmental change that initiates 
speciation, the populations of different species inhabiting the same geographic area should all be 
affected. In this case, one expects more or less synchronized bursts of speciation in a geographic 
area - that is a "turnover pulse" (Vrba 1985). 

(d) Average duration of parapatric speciation 

In our model, the average waiting time to and the average duration of allopatric speciation are 
identical. Previously, Lande (1985) and Newman et al. (1985) studied how an isolated population 
can move from one adaptive peak to another by random genetic drift. They showed that the 
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average duration of stochastic transitions between the peaks is much shorter than the time that the 
population spends in a neighborhood of the initial peak before the transition. Within the framework 
used by these authors stochastic transitions are possible in a reasonable time only if the adaptive 
valley separating the peaks is shallow. This implies that reproductive isolation resulting from a 
single transition is very small. Potentially, strong or even complete reproductive isolation (that is 
speciation) can result from a series of peak shifts along a chain of "intermediate" adaptive peaks 
such that each individual transition is across a shallow valley but the cumulative effect of many 
peak shifts is large (Walsh 1982). In this case, the results of Lande (1985) and Newman et al. (1985) 
actually imply that the population will spent a very long time at each of the intermediate adaptive 
peaks. This would lead to a very long duration of allopatric speciation that is in fact comparable 
to the overall waiting time to speciation. 

For parapatric speciation, the predictions are very different. Our results about the duration 
of speciation lead to three important generalizations. The first is that the average duration of 
parapatric speciation, T , is much smaller than the average waiting time to speciation, t . This 
feature of the models studied here is compatible with the patterns observed in the fossil record 
which form the empirical basis of the theory of punctuated equilibrium (Eldredge 1971; Eldredge 
& Gould 1972). The second generalization concerns the absolute value of T . The waiting time to 
speciation changes dramatically with slight changes in parameter values. In contrast, the duration 
of speciation is on the order of one over the mutation rate over a subset of the loci affecting 
reproductive isolation for a wide range of migration rates, population sizes, intensities of selection 
for local adaptation, and the number of genetic changes required for reproductive isolation. Given 
a "typical" mutation rate on the order of 10~ 5 — 10 -6 per locus per generation (e.g. Griffiths et al. 
1996; Futuyma 1997) and assuming that there are at least on the order of 10-100 genes involved in 
the initial stages of the evolution of reproductive isolation (e.g. Singh 1990; Wu & Palopoli 1994; 
Coyne & Orr 1998; Naveira & Masida 1998), the duration of speciation is predicted to range between 
10 3 and 10 5 generations with the average on the order of 10 4 generations. The third generalization 
is about the likelihood of situations where strong but not complete reproductive isolation between 
populations is maintained for an extended period of time (much longer than the inverse of the 
mutation rate) in the presence of small migration without the populations becoming completely 
isolated or completely compatible. Judging from our theoretical results, such situations appear to 
be extremely improbable. 

(e) Validity of the approximations used 

The results presented here are based on a number of approximations the most important of which 
is the assumption that within-population genetic variation in the loci underlying reproductive iso- 
lation can be neglected. A biological scenario to which this assumption is most applicable is that of 
a small (peripheral) populations with not much genetic variation maintained and with occasional 
influx of immigrants from the main population. [ Note that within-population genetic variation 
in the loci underlying reproductive isolation has to be manifested in reproductive incompatibili- 
ties between some members of the population. However, the overall proportion of incompatible 
mating pairs within the population is not expected to be large (e.g. Wills 1977; Nei et al. 1983; 
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Gavrilets 1999).] Intuitively, one might expect that increasing within-population variation would 
substantially increase the rate of substitutions by random genetic drift and make speciation easier. 
However, in polymorphic populations the alleles affecting the degree of reproductive isolation can- 
not be treated as neutral because they are weakly selected against than rare (Gavrilets et al. 1998, 
2000a; Gavrilets 1999). In the absence of selection for local adaptation this might make speciation 
somewhat more difficult. Allowing for genetic variation among immigrants can increase the plau- 
sibility of speciation. For example, if new alleles are deleterious in the ancestral environment and 
are maintained there by mutation, their equilibrium frequency will be order v/s*, where s* is the 
selection coefficient against new alleles in the ancestral environment. Thus, the overall frequency 
of new alleles in the population per generation will increase from v to approximately v + m-^. 
Intuitively, this can result in a substantial reduction in the waiting time to speciation. The overall 
effect of genetic variation (both within-population and among immigrants) on the waiting time to 
parapatric speciation has to be explored in a systematic way. This is especially important given 
that the individual-based simulations reported in Gavrilets et al. (1998, 2000a) show that rapid 
speciation is possible well beyond the domain of parameter values identified here as conducive to 
speciation. As for the duration of speciation, I expect it to have an order of one over the level of 
genetic variation maintained in the loci underlying reproductive isolation. As such, with genetic 
variation the duration of speciation is expected to be (much) shorter than 1/v. 

I am grateful to Janko Gravner for helpful suggestions and to Chris Boake for comments on the 
manuscript. This work is supported by National Institutes of Health grant GM56693 and by EPPE 
fund (University of Tennesee, Knoxville). 



APPENDIX 

Average waiting time till speciation. I consider a Markov chain with K+l states 0, 1, . . . , K, K+ 

1. Let pij be the corresponding transition probabilities. I assume that the state K + 1 is absorb- 
ing but the state is not. Let U be the average time till absorption starting from i. The mean 
absorption times satisfy to a general system of linear equations 

U = l + Y,Piitj (16) 

j 

for i = 0, 1, . . . , K with t^+i = (e.g. Norris 1997). I assume that the transition probabilities are 
Pi t i+i = \,Pi,i-i = Hi and — if \i — j\ > 1 with Hk+i = 0. In this case, the system of linear 
equations (^) can be solved by standard methods (e.g. Karlin & Taylor 1975). 

Let Zi — t i — From equation ( |T6l) with i = one finds an equality to — 1 + Aoti + (1 — Ao)io 
which can be rewritten as 

z = 1/Ao- (17a) 
In a similar way, for i > one finds an equality t j = 1 + Ajt i+ i + /Mi-i + (1 ~ A, — Hi)ti which can 
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be rewritten as 

Zi = y^-i + — . (17b) 

Aj Ai 

The solution of the system of linear recurrence equations ( O) is 



^ = f+Eir-. ( 18a ) 

where 

^ ~~ t~\ — r ( ibb J 

A\A2 • • • Aj 

with p = 1. One can also see that Yh=q z % — (t — ti) + (ti — t 2 ) + ■ ■ ■ (tx — tjc+i) = *o- Thus, t 
can be found by summing up equations ( |18a|) : 

T K o K i o- 

*o = ^ + EEf (19) 
Ao i= i j=1 AjPj 



The absorption times U corresponding to i > can be found recursively using equation (|17b| ). 
With a threshold function of reproductive compatibility @, 

Pj = R?j\, (20) 
where R = (m/v)exp(—S). With a linear function of reproductive compatibility fllCf ), 

"> = Rljt (K+iw-nv (21) 

Average duration of speciation. The average duration of speciation, To, can be defined 
as the average time that it takes to walk from state to state K + 1 without returning to state 
0. Ewens (1979, Section 2.11) provides formulae that can be used to find T . These formulae are 
summarized below. 

The probability of entering state K + 1 before state starting from i is 

*< = X>/X>i- (22) 

j=0 j=0 

Starting from state i, the mean time spent in state j before entering state or state K + 1 is 

j'-i 

t tj = (1 - 7r<) E Pfc/ (PjAj) for j = 1, . . . , z, (23a) 

fc=0 

k 

Uj = Pk/(Pj^j) for j = z + 1, . . . ,K (23b) 
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Starting from state i, the conditional mean time spent in state j for those cases for which the state 
K + 1 is entered before state is 

t* j = t ij ir j /Tr i . (24) 
The condition mean time till absorption in K + 1 is 

*,* = XX- ( 25 ) 

The average duration of speciation is the sum of the average time spent in state before moving 
to state 1, which is 1/Ao, plus the conditional mean time till absorption in K+l starting from state 
1, which is £*, 

r = l/Ao + tJ. (26) 
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